NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Learning Safe Action Models with Partial Observability

https://doi.org/10.1609/aaai.v38i18.29995

Le, Hai S; Juba, Brendan; Stern, Roni (March 2024, Proceedings of the AAAI Conference on Artificial Intelligence)

A common approach for solving planning problems is to model them in a formal language such as the Planning Domain Definition Language (PDDL), and then use an appropriate PDDL planner. Several algorithms for learning PDDL models from observations have been proposed but plans created with these learned models may not be sound. We propose two algorithms for learning PDDL models that are guaranteed to be safe to use even when given observations that include partially observable states. We analyze these algorithms theoretically, characterizing the sample complexity each algorithm requires to guarantee probabilistic completeness. We also show experimentally that our algorithms are often better than FAMA, a state-of-the-art PDDL learning algorithm.
more » « less
Full Text Available
Safe Learning of Lifted Action Models

https://doi.org/10.24963/kr.2021/36

Juba, Brendan; Le, Hai S.; Stern, Roni (September 2021, Proceedings of the 18th International Conference on Principles of Knowledge Representation and Reasoning)

Creating a domain model, even for classical, domain-independent planning, is a notoriously hard knowledge-engineering task. A natural approach to solve this problem is to learn a domain model from observations. However, model learning approaches frequently do not provide safety guarantees: the learned model may assume actions are applicable when they are not, and may incorrectly capture actions' effects. This may result in generating plans that will fail when executed. In some domains such failures are not acceptable, due to the cost of failure or inability to replan online after failure. In such settings, all learning must be done offline, based on some observations collected, e.g., by some other agents or a human. Through this learning, the task is to generate a plan that is guaranteed to be successful. This is called the model-free planning problem. Prior work proposed an algorithm for solving the model-free planning problem in classical planning. However, they were limited to learning grounded domains, and thus they could not scale. We generalize this prior work and propose the first safe model-free planning algorithm for lifted domains. We prove the correctness of our approach, and provide a statistical analysis showing that the number of trajectories needed to solve future problems with high probability is linear in the potential size of the domain model. We also present experiments on twelve IPC domains showing that our approach is able to learn the real action model in all cases with at most two trajectories.
more » « less
Full Text Available
Precision-Recall versus Accuracy and the Role of Large Data Sets

https://doi.org/https://doi.org/10.1609/aaai.v33i01.33014039

Juba, Brendan; Le, Hai S. (January 2019, Proceedings of the AAAI Conference on Artificial Intelligence)

Full Text Available
Conditional Sparse Lp-norm Regression With Optimal Probability

Hainline, John; Juba, Brendan; Le, Hai S.; Woodruff, David (January 2019, Proceedings of Machine Learning Research)

Full Text Available
Conditional Sparse L_p-norm Regression with Optimal Probability

Hainline, John; Juba, Brendan; Le, Hai S.; Woodruff, David P. (January 2019, AISTATS)

Full Text Available

Search for: All records